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Version: 13.0 


Case Study: 1 


Relecloud 
General Overview 


Relecloud is a social media company that processes hundreds of millions of social 
media posts per day and sells advertisements to several hundred companies. 


Relecloud has a Microsoft SQL Server database named DB1 that stores information 
about the advertisers. DB1 is hosted on a Microsoft Azure virtual machine. 


Physical locations 


Relecloud has two main offices. The offices we located in San Francisco and New 
York City. 


The offices connected to each other by using a site-to-site VPN. Each office 
connects directly to the Internet. 


Business model 


Relecloud modifies the pricing of its advertisements based on trending topics. Topics 
are considered to be trending if they generate many mentions in a specific country 
during a 15-minute time frame. The highest trending topics generate the highest 
advertising revenue. 


CTO statement 


Relecloud wants to deliver reports lo the advertisers by using Microsoft Power BI. 
The reports will provide real-time data on trending topics, current advertising rates, 
and advertising costs for a given month. 


Relecloud will analyze the trending topics data, and then store the data in a new 
data warehouse for ad-hoc analysis. The data warehouse is expected to grow at a 
rate of 1 GB per hour or 8.7 terabytes (TB) per year. The data will be retained for 
five years for the purpose of long term trending. 


Requirements 
Business goals 


Management at Relecloud must be able to view which topics are trending to adjust 
advertising rates in near real-time. 
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Planned changes 


Relecloud plans to implement a new streaming analytics platform that will report on 
trending topics. Relecloud plans to implement a data warehouse named DB2. 


General technical requirements 


Relecloud identifies the following technical requirements: 


° Social media data must be analyzed to identify trending topics in real 
time. 

® The use of Infrastructure as a Service (laaS) platforms must 
minimized, whenever possible. 

° The real-time solution used to analyze the social media data must 


support selling up and down without service interruption. 
Technical requirements for advertisers 
Relecloud identifies the following technical requirements for the advertisers 


e The advertisers must be able to see only their own data in the Power 
BI reports. 

° The advertisers must authenticate to Power BI by using Azure Active 
Directory (Azure AD) credentials. 

° The advertisers must be able to leverage existing Transact-SQL 
language knowledge when developing the real-time streaming solution. 

° Members of the internal advertising sales team at Relecloud must be 
able to see only the sales data of the advertisers to which they are assigned. 
° The Internal Relecloud advertising sales team must be prevented from 
inserting, updating, and deleting rows for the advertisers to which they are not 
assigned. 

® The internal Relecloud advertising sales team must be able to use a 
text file to update the list of advertisers, and then to upload the file to Azure 
Blob storage. 


DB1 requirements 


Relecloud identifies the following requirements for DB1: 


e Data generated by the streaming analytics platform must be stored in 
DB1. 

° The user names of the advertisers must be mapped to CustomerID in 
a table named Table2. 

° The advertisers in DB1 must be stored in a table named Table1 and 
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must be refreshed nightly. 
e The user names of the employees at Relecloud must be mapped to 
EmployeelD in a table named Table3. 


DB2 requirements 


Relecloud identifies the following requirements for DB2: 


e DB2 must have minimal storage costs. 

e DB2 must run load processes in parallel. 

e DB2 must support massive parallel processing. 

° DB2 must be able to store more than 40 TB of data. 

° DB2 must support scaling up and down, as required. 

e Data from DB1 must be archived in DB2 for long-term storage. 

° All of the reports that are executed from DB2 must use aggregation. 

° Users must be able to pause DB2 when the data warehouse is not in 
use. 

° Users must be able to view previous versions of the data in DB2 by 


using aggregates. 
ETL requirements 


Relecloud identifies the following requirements for extract, transformation, and load 
(ETL): 


° Data movement between DB1 and DB2 must occur each hour. 
° An email alert must be generated when a failure of any type occurs 
during ETL processing. 


ris_table1 


You execute the following code for a table named rls_table1. 
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create function rls tablel (@CustomerId int, @SalesPersonId int) 
returns table 
with schemabinding 
as 
return 
select 1 as result 
from dbo.tablel 
join dbo.table2 on tablel.customerid = Table2.CustomerId 
where table2.UserName = suser_sname() 
and tablel.customerid =@CustomerId 
union ail 
select 1 as result 
from dbo.tablel 
join dbo.table3 on tablel.salespersonid = table3.Employeelid 
where table3.UserName = suser_sname () 
and tablel.salespersonid = @SalesPersonId 
go 


dbo.table1 
You use the following code to create Table1. 


create table tablei 
(customerid int, 
selespersonid int 

) 

Go 


Streaming data 


The following is a sample of the Streaming data. 


User Country Topic Time 

useri USA Topicl 2017-01-01T00:00:01.00000002 
userl USA Topic3 2017-01-01T00:02:01.00000002 
user2 Canada Topic2 2017-01-01T00:01:11.00000002 
user3 India Topici 2017-01-01T00:03:14.00000002 
Question: 1 


Which technology should you recommend to meet the technical requirement for 
analyzing the social media data? 
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A. Azure Stream Analytics 
B. Azure Data Lake Analytics 
C. Azure Machine Learning 


D. Azure HDInsight Storm clusters 


Answer: A 


Azure Stream Analytics is a fully managed event-processing engine that lets you set 
up real-time analytic computations on streaming data. 

Scalability 

Stream Analytics can handle up to 1 GB of incoming data per second. Integration 
with Azure Event Hubs and Azure loT Hub allows jobs to ingest millions of events 
per second coming from connected devices, clickstreams, and log files, to name a 
few. Using the partition feature of event hubs, you can partition computations into 
logical steps, each with the ability to be further partitioned to increase scalability. 


Question: 2 


DRAGDROP 


You need to implement a solution that meets the data refresh requirement for DB1. 
Which three actions should you perform in sequence? To answer, move the 
appropriate actions from the list of actions to the answer area and arrange them in 
the correct order. 
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Actions Answer Area 


In DBI, create external objects. 

From the Azure portal, export the 

storage account key. 

In DB1, create a stored procedure that (>) 
imports data from an external table to 


Table. 


From the Azure portal, create and 
schedule an Azure Automation job 
that executes the stored procedure. 


In DB1, create a staging table. 


©O 


Answer: 


Answer Area 
In DB1, create a stored procedure that | 
imports data from an external table to 
Table1. 


From the Azure portal, export the 


e account key. 


[From the Azure portal, create and 
| schedule an Azure Automation job 
that executes the stored procedure. 


Azure Data Factory can be used to orchestrate the execution of stored procedures. 
This allows more complex pipelines to be created and extends Azure Data Factory's 
ability to leverage the computational power of SQL Data Warehouse. 

From scenario: 

Relecloud has a Microsoft SQL Server database named DB1 that stores information 
about the advertisers. DB1 is hosted on a Microsoft Azure virtual machine. 


Relecloud identifies the following requirements for DB1: 
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e Data generated by the streaming analytics platform must be stored in DB1. 


e The advertisers in DB1 must be stored in a table named Table1 and must be 
refreshed nightly. 


Question: 3 


DRAG DROP 


You need to create a query that identifies the trending topics. 

How should you complete the query? To answer, drag the appropriate values to the 
correct targets. Each value may be used once, more than once, or not at all. You 
may need to drag the split bar between panes or scroll to view content. 

NOTE: Each correct selection is worth one point. 


LE EEE] 
Answer Area 
DATETIME SELECT Country, Topic, count(*) 
GROUP BY FROM Input | Value | BY Time 


Value Country, Topic, Value (minute, 15) 


< 
pas c 
oO o 
m ” 
a 
oa 
< 
000o 


Answer: 


Answer Area 


SELECT Country, Topic, count(*) 


BY Time 


[ [sroursy | Country, Topic, [Slidingwindow (minute, 15) 


FROM Input | | TIMESTAMP 
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From scenario: Topics are considered to be trending if they generate many mentions 
in a specific country during a 15-minute time frame. 

Box 1: TimeStamp 

Azure Stream Analytics (ASA) is a cloud service that enables real-time processing 
over streams of data flowing in from devices, sensors, websites and other live 
systems. The stream-processing logic in ASA is expressed in a SQL-like query 
language with some added extensions such as windowing for performing temporal 
calculations. 

ASA is a temporal system, so every event that flows through it has a timestamp. A 
timestamp is assigned automatically based on the event's arrival time to the input 
source but you can also access a timestamp in your event payload explicitly using 
TIMESTAMP BY: 

SELECT * FROM SensorReadings TIMESTAMP BY time 

Box 2: GROUP BY 

Example: Generate an output event if the temperature is above 75 for a total of 5 
seconds 

SELECT sensorld, MIN(temp) as temp 

FROM SensorReadings 

TIMESTAMP BY time 

GROUP BY sensorld, SlidingWindow(second, 5) 

HAVING MIN(temp) > 75 

Box 3: SlidingWindow 

Windowing is a core requirement for stream processing applications to perform set- 
based operations like counts or aggregations over events that arrive within a 
specified period of time. ASA supports three types of windows: Tumbling, Hopping, 
and Sliding. 

With a Sliding Window, the system is asked to logically consider all possible 
windows of a given length and output events for cases when the content of the 
window actually changes — that is, when an event entered or existed the window. 


Question: 4 


HOTSPOT 


You implement DB2. 

You need to configure the tables in DB2 to host the data from DB1. The solution 
must meet the requirements for DB2. 

Which type of table and history table storage should you use for the tables? To 
answer, select the appropriate options in the answer area. 

NOTE: Each correct selection is worth one point. 
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Answer area 


Table: 


Change Data Capture 
Change tracking 
Temporal table 


History table storage: 
Clustered columnstore 
In-Memory OLTP 
Row store 


Answer: 


Table: 


Change Data Capture 
Change tracking 


- 


History table storage: 


e 


Viemory C 


From Scenario: Relecloud plans to implement a data warehouse named DB2. 

Box 1: Temporal table 

From Scenario: 

Relecloud identifies the following requirements for DB2: 

Users must be able to view previous versions of the data in DB2 by using 
aggregates. 

DB2 must be able to store more than 40 TB of data. 

A system-versioned temporal table is a new type of user table in SQL Server 2017, 
designed to keep a full history of data changes and allow easy point in time analysis. 
A temporal table also contains a reference to another table with a mirrored schem 

a. The system uses this table to automatically store the previous version of the row 
each time a row in the temporal table gets updated or deleted. This additional table 
is referred to as the history table, while the main table that stores current (actual) 
row versions is referred to as the current table or simply as the temporal table. 
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Question: 5 


DRAG DROP 


You need to implement rls_table1. 

Which code should you execute? To answer, drag the appropriate values to the 
correct targets. Each value may be used once, more than once, or not at all. You 
may need to drag the split bar between panes or scroll to view content. 

NOTE: Each correct selection is worth one point. 


eeesoe 
Values Answer Area 
CREATE | Value | POLICY dbo.rls_table1_policy 
ADD Value PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.table1, 
e ADD| Vale | PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.table1 BEFORE UPDATE, 
: ADD [| Value | PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.tablel BEFORE DELETE, 
ADD Value | PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.table1 AFTER INSERT 
with ( state = on ) 
Answer: 
Answer Area 


CREATE | POLICY dbo.rls_table1_policy 

ADD | PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.table1, 

ADD [Block | | PREDICATE dbo.rls_table1(Customerld, salespersonid) on dbo.tablel BEFORE UPDATE, 
ADD | PREDICATE dbo.rls_table1(Customerid, salespersonid) on dbo.table1 DELETE, 
ADD | PREDICATE dbo.rls_table1(Customerid, salespersonid) on dbo.table1 AFTER INSERT 
with ( state = on ) 


Box 1: Security 
Security Policy 
Example: After we have created Predicate function, we have to bind it to the table, 


using Security Policy. We will be using CREATE SECURITY POLICY command to 
set the security policy in place. 
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CREATE SECURITY POLICY DepartmentSecurityPolicy 

ADD FILTER PREDICATE dbo.DepartmentPredicateFunction(UserDepartment) ON 
dbo.Department 

WITH(STATE = ON) 

Box 2: Filter 

[ FILTER | BLOCK ] 

The type of security predicate for the function being bound to the target table. 
FILTER predicates silently filter the rows that are available to read operations. 
BLOCK predicates explicitly block write operations that violate the predicate 
function. 

Box 3: Block 

Box 4: Block 

Box 5: Filter 


Question: 6 


HOTSPOT 


Which service solution and which table storage solution should you recommend for DB2? To answer, 
select the appropriate options in the answer area. 
NOTE: Each correct selection is worth one point. 


Service: 


An Azure virtual machine that has SQL Server installed 


Azure SQL Data Warehouse 
Azure SQL Database 


Table storage: 


Clustered columnstore index 


Clustered index 
In-Memory OLTP 


Answer: 
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Service: 


An Azure virtual machine that has SQL Server installed 


Azure SQL Data Warehouse 
Azure SQL Database 


Clustered columnstore index ahd 


Table storage: 


Clustered index 
In-Memory OLTP 


Explanation: 

Box 1: Azure SQL Data Warehourse 

Scenario: Relecloud plans to implement a data warehouse named DB2. 

Box 2: Clustered Columnstore index 

Columnstore index is a new type of index introduced in SQL Server 2012. It is a column-based non- 
clustered index geared toward increasing query performance for workloads that involve large amounts 
of data, typically found in data warehouse fact tables. 

A clustered columnstore index is the physical storage for the entire table. 

Scenario: 

Relecloud identifies the following requirements for DB2: 

DB2 must be able to store more than 40 TB of data. 

References: 


https://docs.microsoft.com/en-us/sql/relational-databases/indexes/columnstore-indexes-overview 


Question: 7 


HOTSPOT 


You need to configure the alert to meet the requirements for ETL. 

Which settings should you use for the alert? To answer, select the appropriate options in the answer 
area. 

NOTE: Each correct selection is worth one point. 
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Event: 


Activity Run Started 
(On-Demand HDI Cluster Create Start 
(On-Demand HDI Cluster Created Successfully 
(On-Demand HDI Cluster Deleted 


=e 


| Succeeded 


— ¥ 


Abandoned 
Failed Execution 
Failed Resource Allocation 


Status: 


Substatus: 


Failed Validation 
Timed Out 


Activity Run Finished Y 
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Answer: 
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Event: 


Activity Run Finished iv 


(On-Demand HDI Cluster Deleted 


|? 


Succeeded 


Status: 


Substatus: 


Abandoned 
Failed Execution 


Failed Resource Allocation 
Failed Validation 
Timed Out 


Explanation: 
Scenario: Relecloud identifies the following requirements for extract, transformation, and load (ETL): 
An email alert must be generated when a failure of any type occurs during ETL processing. 


Case Study: 3 


Litware, Inc 

Overview 

General Overview 

Litware, Inc. is a company that manufactures personal devices to track physical 
activity and other health-related data. 

Litware has a health tracking application that sends health-related data horn a user's 
personal device to Microsoft Azure. 

Physical Locations 

Litware has three development and commercial offices. The offices are located in 
the Untied States, Luxembourg, and India. 

Litware products are sold worldwide. Litware has commercial representatives in 
more than 80 countries. 

Existing Environment 
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Environment 

In addition to using desktop computers in all of the offices. Litware recently started 
using Microsoft Azure resources and services for both development and operations. 
Litware has an Azure Machine Learning Solution. 

Litware Health Tracking Application 

Litware recently extended its platform to provide third-party companies with the 
ability to upload data from devices to Azure. The data can be aggregated across 
multiple devices to provide users with a comprehensive view of their global health 
activity. 

While the upload from each device is small, potentially more than 100 million 
devices will upload data daily by using an Azure event hub. 

Each health activity has a small amount of data, such as activity type, start 
date/time, and end date/time. Each activity is limited to a total of 3 KB and includes a 
customer Identification key. 

In addition to the Litware health tracking application, the users’ activities can be 
reported to Azure by using an open API. 

Machine Learning Experiments 

The developers at Litware perform Machine Learning experiments to recommend an 
appropriate health activity based on the past three activities of a user. 

The Litware developers train a model to recommend the best activity for a user 
based on the hour of the day. 

Requirements 

Planned Changes 

Litware plans to extend the existing dashboard features so that health activities can 
be compared between the users based on age, gender, and geographic region. 
Business Goals 

Minimize the costs associated with transferring data from the event hub to Azure 
Storage. 

Technical Requirements 

Litware identities the following technical requirements: 

Data from the devices must be stored from three years in a format that enables the 
fast processing of data fields and Filtering. 

The third-party companies must be able to use the Litware Machine learning models 
to generate recommendations to their users by using a third-party application. 

Any changes to the health tracking application must ensure that the Litware 
developers can run the experiments without interrupting or degrading the 
performance of the production environment. 

Privacy Requirements 

Activity tracking data must be available to all of the Litware developers for 
experimentation. The developers must be prevented from accessing the private 
information of the users. 

Other Technical Requirements 

When the Litware health tracking application asks users how they feel, their 
responses must be reported to Azure. 


Question: 8 
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You need to recommend a data handling solution to support the planned 
changes to the dashboard. The solution must meet the privacy requirements. 


What is the best recommendation to achieve the goal? More than one answer 
choice may achieve the goal. Select the BEST answer. 


A. anonymization 
B. encryption 

C. obfuscation 

D. compression 


Answer: C 


Question: 9 


You need to recommend a data transfer solution to support the business goals. 
What should you recommend? 


A. Configure the health tracking application to cache data locally for 24 hours. 

B. Configure the health tracking application to Aggregate activities in blocks of 128 
KB. 

C. Configure the health tracking application to cache data locally tor 12 hours. 

D. Configure the health tracking application to aggregate activities in blocks of 64 
KB. 


Answer: D 


Question: 10 


You need to recommend a permanent Azure Storage solution for the activity data. 
The solution must meet the technical requirements. 

What is the best recommendation to achieve the goal? More than one answer 
choice may achieve the goal. Select the BEST answer. 

A. Azure SQL Database 

B. Azure Queue storage 

C. Azure Blob storage 

D. Azure Event Hubs 


Answer: A 
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Question: 11 


Users report that when they access data that is more than one year old from a 
dashboard, the response time is slow. 

You need to resolve the issue that causes the slow response when visualizing older 
data. What should you do? 

A. Process the event hub data first, and then process the older data on demand. 

B. Process the older data on demand first, and then process the event hub data. 

C. Aggregate the older data by time, and then save the aggregated data to 
reference data streams. 

D. Store all of the data from the event hub in a single partition. 


Answer: C 


Question: 12 


You extend the dashboard of the health tracking application to summarize fields 
across several users. 

You need to recommend a file format for the activity data in Azure that meets the 
technical requirements. 

What is the best recommendation to achieve the goal? More than one answer 
choice may achieve the goal. Select the BEST answer. 


Answer: E 
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